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MDS convolutional codes have the property that their free distance is maximal 
' among all codes of the same rate and the same degree. In this paper we introduce a class 

of MDS convolutional codes whose column distances reach the generalized Singleton 
bound at the earliest possible instant. We call these codes strongly MDS convolutional 
J> , codes. It is shown that these codes can decode a maximum number of errors per time 

interval when compared with other convolutional codes of the same rate and degree. 
These codes have also a maximum or near maximum distance profile. A code has a 
maximum distance profile if and only if the dual code has this property. 

Keywords: MDS codes, convolutional codes, column distances, feedback decoding, 
superregular matrices. 



*The research of this paper was presented at the 2002 IEEE International Symposium on Information 
Theory in Lausanne, Switzerland, June 30-July 5, 2002 and at the International Symposium on the Math- 
ematical Theory of Networks and Systems (MTNS), University of Notre Dame, August, 12-16 2002. The 
authors were supported in part by NSF grants DMS-00-72383 and CCR-02-05310. 



1 



1 Introduction 



In comparison to the literature on linear block codes there exist only relatively few algebraic 
constructions of convolutional codes having some good designed distance. There are even 
fewer algebraic decoding algorithms which are capable of exploiting the algebraic structure 
of the code. 

Convolutional codes are typically decoded via the Viterbi algorithm which has the advan- 
tage that soft information can be processed. This algorithm has however the disadvantage 
that it is too complex for codes with large degree or large memory or when the block length 
is large. The algorithm is also not practical for convolutional codes defined over large alpha- 
bets. There are some alternative sub-optimal algorithms such as sequential decoding and 
feedback decoding. All these algorithms do not in general exploit the algebraic structure of 
the convolutional code. 

In applications where codes over large alphabets are required the codes of choice are 
linear block codes with large distance such as Reed-Solomon codes and more general algebraic 
geometric codes. These codes can be algebraically decoded using e.g. the Berlekamp-Massey 
algorithm or some of its generalizations. 

In this paper we introduce a new class of convolutional codes which we call strongly 
MDS convolutional codes. These codes are particularly suited for applications where large 
alphabets are involved. The free distance of these codes reaches the generalized Singleton 
bound. This is the maximal possible distance a convolutional code of a certain rate and 
degree can have. The number of errors that strongly MDS convolutional codes can correct 
per time interval is in a certain sense maximal as well. We will make this precise in Sectional 

Let F be any finite field and denote by ¥[D] and ¥((D)) the polynomial ring respectively 
the field of all formal Laurent series over F, i. e. 

L oo 

¥[D] = {J2 a i Dj L G N °' a i G F } and F (( D )) = { a i Dj 1 G Z ' a i G F }' 

3=0 j=l 

For v = J2JLi VjD 3 G ¥([D)) n \{0} we define V := min{j G Z | Vj ^ 0} to be the delay of the 
sequence v, that is the time instant, at which the sequence actually starts. We put := oo. 

Let G G F[D] fcxn be a k x n polynomial matrix of rank k. We define a convolutional code 
of rate k/n as the set 

C := {uG | u G ¥((D)) k } C ¥((D)) n (1.1) 

and say that G is a generator matrix of the code C. Two generators of C differ only by a 
nonsingular left transformation over ¥((D)). It is well-known that we can assume G to be 
basic and minimal in the following sense. 

Definition 1.1 (see [2|) A polynomial generator matrix G G F[D] fcx " is called basic if it 
has a polynomial right inverse (equivalently, if the k x fc-minors are coprime in F [£)]). It 
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is called minimal if X]i=i U U where z/j denotes the ith row degree of G, attains the minimal 
value among all generator matrices of C. 

Two basic generator matrices differ only by a unimodular left transformation over ¥[D]. 
If G is a minimal basic encoder one defines the degree of C as the number 5 := 52f=i u i- 
In the literature the degree 5 is sometimes also called the total memory [T2] or the overall 
constraint length [5] or the complexity of the minimal basic generator matrix G(D). We 
like to use the term degree as it corresponds to the term McMillan degree used in systems 
theory [HI EH HHj • We also wish to point out that in algebraic geometry the degree corresponds 
to the degree of an associated vector bundle (i.e. quotient sheaf), see [1211201122] for more 
details. 

Since the degree depends only on the code itself, but not on the specific choice of the 
generator matrix G, we will call 5 the degree of the code C. Recall also from Forney [3] 
that the set {z/ 1; . . . , of row degrees is the same for all minimal basic encoders of C. 
Because of this reason McEliece jTJ] calls these indices the Forney indices of the code C. As 
a consequence, also the number v := max{z/i, . . . , depends only on the code C itself and 
is usually called the memory of the code. In the sequel we will adopt the notation used by 
McEliece [T51 p. 1082] and call a convolutional code of rate k/n and degree 5 an (n, k, S)- 
code. Every (n, k, <5)-code C can also be represented in terms of a parity check matrix, i. e. 
a matrix H G W((D)) [n ~ k)xn such that 

C = {v G ¥((D)) n | vH T = 0}. 

It is clear that we can choose H to be polynomial, thus H 6 F[D]( n ~ fc ^ xri , and basic. Notice 
also that GH T = for any generator matrix G of C. 

For a vector v e F n , we define its weight wt(v) as the number of all its nonzero compo- 
nents. For v = J^JLi VjD-i G F([D)) n we define 

00 

wt(v) := y^wtfa) G N U {00}. 

3=1 

Finally, the free distance of the convolutional code C C ¥((D)) n is defined through 

d liee := min{wt(w) | v G C,v ^ 0}. (1.2) 

It is an easy, but crucial observation that a basic generator matrix G yields a non-catastrophic 
and delay-free encoder, i. e., if v — uG G F([D)) n for some u G F((D)) fc , then 

wt(t> ) finite =^ wt(n) finite 



and 



Therefore, in case we are given a basic generator matrix G, the free distance can also be 
obtained as 



! frco = mm 



= mm 



[wt(v) | v = uG for some u E ¥[D] k \{0}} 
|wt(f) | v = uG for some u G ¥[D] k \{0},u ^ 0}. 



An (n, k, 5) convolutional code is called MDS if its free distance is maximal among all 
rate k/n convolutional codes of degree 5, i.e. an (n, k, 6) convolutional code is MDS if the 
free distance achieves the generalized Singleton bound [22J: 



The concept of MDS convolutional codes was introduced by the authors in [221 EH] • Strongly 
MDS codes are going to be a subclass of MDS codes which have a remarkable decoding 
capability. 

The paper is structured as follows: In Section |2] we review notions from convolutional 
coding theory such as the column distances, the generalized Singleton bound and we intro- 
duce the important concepts for this paper, namely the property of being strongly MDS 
and having a maximum distance profile. In Section |3] we show the existence of strongly 
MDS codes in the situation when the rate is (n — l)/n. In order to do so we introduce 
the interesting concept of a superregular matrix which might be of independent interest. In 
Section |U we illustrate the concepts through a series of examples. In Section we investigate 
to what extend properties of MDS, strongly MDS and maximum distance profile carry over 
to the dual code. The main result of this section states that a code has a maximum distance 
profile if and only if its dual has this property. This allows us then to show that for certain 
specific parameters a code is strongly MDS if and only if its dual is strongly MDS. Finally 
in Section |H1 we show how strongly MDS convolutional codes can be decoded via feedback 
decoding. It turns out that the number of errors which can be decoded per time interval 
compares well to a maximum distance separable block code. 



2 Strongly MDS Codes and Codes with Maximum Dis- 
tance Profile 



In this section we will recall the column distances of a convolutional code and their relation 
to the free distance. After showing some upper bounds for these distances we will introduce 
the notion of strongly MDS codes. It describes codes, for which the column distances attain 
their maximum value. 

Throughout this section let C C ¥([D)) n be an (n, k, 5)-code with basic generator matrix 



'free 





(2.1) 
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and basic parity check matrix 



H = H i Dj e ¥[D] (n - k)xn , Hj G W {n ~ k)xn , H ^ ^ 0. 

j=0 



(2.2) 



Notice that v is the memory of the code. For every j G No we define the truncated sliding 
generator and parity check matrices 



G) 



Gq Gi 
Gq 



Ho 

Hi H 

H 3 H j~l 



Go 



Hn 



g ]pO'+i)fcxO'+i)« 



(2.3) 



£ ]pO'+l)("-fc)x(j+l)n 



where we let Gj = (resp. Hj = 0) whenever j > v (resp. j > /x), see also |9, p. 110]. The 
identity GH T = immediately implies Gj(HjY = for all j 6 No. Since G and H are both 
basic, the matrices Gj and Hfj both have full rank and therefore we even have 



{uG C j | u G ¥ u+1)k } = {ve F (j+1)n | v{H^f = 0} for all j G N c 



(2.4) 



The relevance of these matrices rests on the fact that they single out codeword sequences of 
length j in the following sense. 

Remark 2.1 For v := J2T=i v j Di e H( D T and m,M eZ with m< M define 



V[m,M] 



(v m ,v m+1 , . . .,V M/ 



Then we have the following: 

(a) If v = uG for some u G ¥((D)) k with Xl > I, then = up ( j+j]Cr5 and v^j+^H^Y = 
for all j G No. 

(b) If v = uG C j G F^ +1 ' n for some j G No and u G F^ +1 ) fc , then there exists v G C such that 
V > and V[oj] = v. 

(c) For all j G N we have {v m \ v G C, V = 0} = {v = (v , ...,Vj) G F^' +1 ) n | v{H c J = 
0, v ± 0}. 

Part (a) follows easily by equating like powers of D in the equation v = uG and by use 
of (|2.4j) : (b) is obvious by taking u = uG with it = YH=o ^iD' where w = (uo, • • • , (c) is 
a consequence of (a) and (|2.4j) . 



5 



Following pp. 110], the jth column distance of the code C is denned to be 

dj := min {wt(v [QJ] ) | v E C, V = 0}. (2.5) 

Using the remark above we obtain the alternative identities 

d c j = min {wt(v[ j]) \ v = uG, u E ¥[D] k , u ^ 0} 
= min {wt(f [i,i+j]) | v = uG, u E ¥([D)) h , V = Z} 

= min {wt ((m , . . . , | m E ¥ k , u ^ 0} (2.6) 

= min{wt(£) | v = (v , ...,Vj)E ¥ u+1)n , v{H c J = 0, v ^ 0}. (2.7) 

Obviously, < d {tee for all j E N and one even has [HI pp. 113] 

— ^i — ^2 • • • an d hm = c? frcc . (2.8) 

The (z/ + l)-tuple of numbers (d^, d±, d\, . . . , d°), where v is the memory, is is called the 
column distance profile of the code Pp. 112]. 
Equation (|2.7|) immediately implies 

Proposition 2.2 Let d E N. Then the following properties are equivalent. 

(a) dj = d; 

(b) none of the first n columns of Hj is contained in the span of any other d — 2 columns 
and one of the first n columns of is in the span of some other d — 1 columns of that 
matrix. 

We leave it to the reader to verify the equivalence of the statements. 
Proposition 2.3 For every j E No we iave 

d)< (n-k)(j + + 

PROOF: Consider the sliding parity check matrix H? introduced in (|2.3j) . The set of 
vectors v = (vq, . . . ,Vj) E ¥^ +1 ' n , v{H^Y — 0, vq ^ forms a nonlinear subset of the linear 
block code defined by the left kernel of Hj. Since E f( n -*)u+ 1 ) xn u+ 1 ) an y vector in the 
left kernel has weight at most (n — k)(J + l) + l by the usual Singleton bound for block codes 
and this establishes the claim. □ 

The column distances give information about the error- correcting capabilities of the code. 
Precisely, d^ determines the error-correcting capability of a decoder that estimates the mes- 
sage symbol u based on the received symbols f[oj], see also |9, p. 111]. Therefore, a good 
performance for sequential decoding requires the column distances as big as possible. The 
next proposition shows that maximality of dj implies maximality of the preceding column 
distances. 
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Corollary 2.4 If d 

all i < j. 



(n — k)(j + 1) + 1 for some j G No, then d\ = (n — k)(i + 1) + 1 for 
PROOF: It suffices to prove the assertion for i = j — 1. In order to do so notice that 





" 











_ H j Hj-i ■■■ Hi 


H _ 



and assume that one of the first n columns of H^_ x is in the span of some other (n — k)j — 1 
columns. Then rank/fo = n — k implies that one of the first n columns of H!j is in the 
span of some other (n — k)j — 1 + n — k = (n — k)(j ' + 1) 
contradiction to the optimality of d c - by Proposition 



1 columns of H?. But this is a 

□ 



The Singleton-bound for block codes has been generalized to convolutional codes in |22j . 
Therein the following has been shown. 



Theorem 2.5 The free distance of an (n, k, S)-code satisfies 



k)([i\ + l) + S + l. 



(2.9) 



The number appearing on the right in (|2.9J1 is called the generalized Singleton bound. The 
code is called an MDS code if it satisfies d bcc = (n — k) ([Ij + 1) +5+1. It has been shown 



in [22] that for every set of parameters (n, k, 5) and every prime number p there exists a 
suitably large finite field F of characteristic p and an MDS code with parameters (n, k, 5) 
over F. 

The generalized Singleton bound reduces to the usual Singleton bound n — k + 1 when 
5 = 0, the block code situation. 

The proof of the existence of MDS codes given in [221 is based on techniques from algebraic 
geometry and is non-constructive. In j2Hj a construction of MDS codes with parameters 
(n, k, 8) was given for suitably large fields of characteristic coprime with n. 

In the sequel we will strengthen the MDS property by requiring that the generalized 
Singleton bound is attained by the earliest column distance possible. This will lead to the 
notion of a strongly MDS code. 

Proposition 2.6 Suppose C be an MDS code with parameters (n,k,5), column distances 
dj, j 6 Mo, and free distance d bac . Let M := min{j e No | dh = d bee }. Then 



M > 



6 




r 5 i 




+ 


n — k 


-k. 
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PROOF: From Proposition 12. 31 we get 



(n — k) 



l)+S+l = d c M < (n-k)(M+l) + l. 



(2.10) 



□ 



This yields the assertion. 

The proof also shows that in the case j > |_f J + [^3^1 the column distance dj never 
attains the upper bound (n — k)(j + 1) + 1 of Proposition 12.31 see also (|2.8j) . 

Definition 2.7 An (n, k, 5)-code with column distances dj, j G No, is called strongly MDS, 
if 



d 



M 



(n — k) 



+ 1 + 5 + 1 for M 



5 




r 5 i 




+ 




X 


n — k 



Because of (12. 8|) the strong MDS property implies the MDS property. 

Remark 2.8 In the case where (n — k) \ 5, the strong MDS property implies that d c M attains 
the upper bound (n — k)(M + 1) + 1, see Proposition 12.21 Hence Corollary 12.41 implies that 
in this case all column distances attain their optimal value. 

If (n — k) \ 5, we always have d c M < (n — k)(M + 1) + 1 as can be seen from (J2.1U)) . 

Even when (n — k) \ 5 it is very desirable that the column distance profile d$, dl, dl, . . . 
has the maximum possible increase at each step. This motivates the following definition. 



Definition 2.9 Let 



L :-- 



5 




5 




+ 




-k. 


.n — k. 



(2.11) 



An (n, k, <5)-code with column distances dj, j G No, is said to have a maximum distance 
profile if 

d) = (n-k)(j + + for j = l, 
Using the notation of Definition 12.71 we have 



L. 



M if (n - k) | 8 
M — 1 otherwise. 



(2.12) 



An immediate consequence of Corollary 12.41 is 



Lemma 2.10 An (n, k, 8)-code has a maximum distance profile if and only if the Lth column 
distance satisfies 

(n - k)(L + I) + 1. 



As a consequence we obtain that if n — k divides 5 then an (n, k, <5)-code has maximum 
distance profile if and only if it is strongly MDS since (n — k) ( I f J + + l) + 1 = (n — 



S 



Remark 2.11 The concept is clearly related to the notion of optimum distance profile 
(ODP), see [3 p. 112]. For ODP it is required that the column distances are maximal 
up to the memory v. Hence if v < L then a code with maximum distance profile is always 
ODP. In general one expects a good code to have generic Forney indices, i.e. the indices 
attain only the two values |~|] and |_f J • McEliece [THl Corollary 4.3] calls such codes compact 
codes. It has been shown in [22] that an MDS code has always generic indices. Of course if 



the indices are generic then v = |~|] and thus v < L + 1. 

The notion of ODP seems also to be dependent on the base field which is usually assumed 
to be the binary field. A code with maximum distance profile does in general not exist over 
the binary field and it can only exist for sufficiently large base fields. This is similar to the 
situation of MDS block codes. Such codes are known to exist as soon as the field size of F 
is larger than the block length n. 

One of the main results of Section will show that a convolutional code has a maximum 
distance profile if and only if its dual has this property. The following algebraic criterion 
which characterizes codes having a maximum distance profile will be very useful. 

Theorem 2.12 Let G = 2~^J=o GjD J be the generator matrix of an (n, k, 5)-code. Let L be 
defined as in (|2.11|) and let 



G 



c 



Go G\ ... Gl 

Gq ... Cr£_i 

Go 



jp(L+l)fcx(L+l)n_ (2.13) 



Then G represents a maximum distance profile code if and only if every (L + l)k x (L + l)k 
full-size minor formed from the columns with indices 1 < ji < . . . < j(L+i)k, where j s k+i > sn 
for s = 1, . . . , L, is nonzero. 

PROOF: Assume there are indices 1 < j\ < . . . < j(L+i)k satisfying j s k+i > sn for s = 
1, . . . , L whose corresponding minor is zero. It follows that there is a vector u = (u , . . . , ul) 
such that uG c L has zero coordinates at positions ji, . . . , j(L+i)fe- Let £ := mm{i \ Ui ^ 0}. 
Consider the vector 

(u e ,...,u L )Gl_ e e¥^- i+ ^. 

The weight of this vector is at most (L — £ + l)(n — k) as there are at least (L — £ + l)k 
coordinates zero. It follows from ()2.(jj) that d c L _ e < (L — £ + l)(n — k) and by Corollary 12.41 
the code has not a maximum distance profile. 

Vice versa assume that C has not a maximum distance profile. Let m := min{i | d\ < 
{n — k){i + 1)}. It follows that there is a vector u = (uo, . . . , u m ), uq ^ such that uG c m has 
at least k{m + 1) zeros. As a submatrix inside G C L we select the columns corresponding to 
the first k(m + 1) positions where uG c m has a zero and we augment it by the last k(L — m) 
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columns of G C L . We call the indices of the selected columns jx, . . . ,j(L+i)k- This gives an 
(L + l)k x {L + l)k full-size minor and we claim that this minor is zero and that the indices 
ji, . . . ,j(L+i)k satisfy j s k+i > sn for s = 1, L. In order to prove the latter note that 
d\ — (n — k) (i + 1) + 1 for % — 0, . . . , m — 1. It therefore follows that (uo, ■ ■ ■ , Ui)G\ has at 
most k(i + l) — 1 zeros for i — 0, . . . , m— 1. In particular j s k+i > sn for s — 1, . . . , m. Clearly 
it is also true for s — m + 1, . . . , L. It remains to be shown that the minor is zero. For this 

" A B 



note that the selected matrix has the form 



C 



where A is an (m + l)k x (m + l)k 



submatrix of G c m which is singular by construction. The fullsize minor is therefore zero. □ 



3 Existence of Strongly MDS (n, n — 1, c^-Codes 

During his investigation of algebraic decoding of convolutional codes B. Allen conjectured in 
his dissertation P the existence of strongly MDS convolutional codes in the situation when 
k = 1 and n = 2. In this section we will show the existence of strongly MDS codes with 
parameters (n, n — 1,8). It follows from Equation (j2.12)l and Lemma 12. 101 that these codes 
also have maximum distance profile. By Theorem 12.51 the generalized Singleton bound for 
these parameters is given by L^rrJ +5 + 2. Thus, Definition 12 . 71 yields that we have to find 
an (n, n — 1, <5)-code such that d c M = M + 2, where M = [^ZiJ + 5. In order to do so, let 



H — [ax, . . . , a n ] e ¥[D] n , where Oj 



j=0 



(3.1) 



be a basic parity check matrix of the desired code. Without loss of generality we may assume 
a 10 = 1. The strong MDS property can now be expressed as follows. 

Theorem 3.1 Let H G ¥[D] n be as in (J33J), let a 10 = 1 and define C := {v G ¥((D)) n | 
vH T = 0} be the code with parity check matrix H. Furthermore, for i — 2, . . . , n let 



a i 



3=0 



(3.2) 



be the Laurent expansion of ^- G ¥(D) and for M 



— J + 5 define 



H :-- 



h()2 ■ ■ ■ ho n 
h\2 ■ ■ ■ hi n 



hMi ■ ■ ■ h 



Mn 



02 



h 



Af-1,2 ' ' 



t-On 



lM-l,n 



h 02 ■■■ h 



On 



(3.3) 



ei, 



eM+i, H 



12- 



H\n, ■ ■ ■ ,H_ 



M+1,2, 



] (z Jp(Af+l)x(Af+l)n^ ^3 4^ 
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where denotes the i-th standard basis vector. We call H the M-th systematic sliding 
parity check matrix of C. The following conditions are equivalent: 

(a) C is strongly MDS, i. e. d c M = M + 2, 

(b) none of the columns Hi 2 , ■ ■ ■ , H\ n of H is contained in the span of any other M columns 
ofH. 

Notice that (b) automatically implies that hji 7^ for all i and j and that also the first 
column e± is not in the span of any other M columns. 

PROOF: After a column permutation the sliding parity check matrix e jr( M + 1 ) x ( M + 1 )™ 
of C has the form 



1 

on 



a u 1 



020 

0-2M 



0>nO 



^20 



Q>nM G2,M-1 



On,M-l 



^20 



0"nO 



It is straightforward to see that left multiplication of H' by the inverse of the first block 
leads to the matrix H of ()3.3|) . After these transformations Proposition 12.21 applied to the 
case j = M and d = M + 2 translates into the equivalence: C is strongly MDS iff neither the 
first column e± nor any of the columns H12, . . . , H\ n is in the span of any other M columns 
of H. But this in turn is equivalent to (b) above. □ 
In order to establish the existence of strongly MDS codes we will proceed as follows. 
Firstly, we will establish the existence of a systematic sliding parity check matrix H as 
in (jSISj) with property (b) of the theorem above. Thereafter, we will show that there exist 
coprime polynomials having maximum degree equal to 5 such that 

M 

— = y hjiD^ + higher powers, i = 2, . . . , n. 

Theorem 13. II then yields that the code with parity check matrix H = [a\, . . . , a n ] is a strongly 
MDS (n,n- l,5)-code. 

As for the first step, let us have a look at the special case of (2, 1, <5)-codes. In this case 
M = 25 and the systematic sliding parity check matrix in (j3.3J) has the form 



H :-- 



h 
h h 



h 



26 



... 

'•• 

hi h 



[I 2S +i,T) e F( 25+1 ) x ( 45 + 2 ), where hj G F. 



(3.5) 

As we will see, the existence of matrices T of any given size and the structure above such 
that H has the column property of Theorem 13.1T b) will be the main tool for the existence of 
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strongly MDS codes even of length n > 2. Therefore we will concentrate on these matrices 
first. The main point is to express the column condition on H in terms of the minors of T. 

Definition 3.2 Let R be a ring. For a matrix T £ R nxk denote by £ R rxs the 

r x s-submatrix obtained from T by picking the rows with indices ii, . . . ,i r and the columns 
with indices ji, . . . , j s . 

In the sequel the following property will play a crucial role. 

Definition 3.3 Let F be field. A lower triangular matrix T £ F™ xfc is said to be superreg- 
ular 1 , if T^ 1 ' is nonsingular for all 1 < r < min{fc, n} and all indices 1 < i\ < . . . < i r < 
n , 1 < ji < ■ ■ ■ < jr < k which satisfy j u < % v for v = 1, . . . ,r. We call the submatrices 
obtained by picking such indices the proper submatrices and their determinants the proper 
minors of T. 

Remark 3.4 Observe that the proper submatrices are the only submatrices which can pos- 
sibly be nonsingular. This can be seen as follows. If j u > i v for some u, then in the submatrix 
T := Tj*'"''j r r the upper right block consisting of the first v rows and the last r — u+ 1 columns 
is identically zero. Hence the first v rows of T can have at most rank v — 1. In other words, 
the improper submatrices of T are trivially singular. For example, for T = (hij) we have 



1,3,4 — 



Tin 





" 










Ai 


^•53 


^54. 



Now we can establish the following. 
Theorem 3.5 Let ¥ be a field and T be a lower triangular Toeplitz matrix, i. e. 

£ F' x/ . (3.6) 

Furthermore, put H := [h,T] = [e\, . . . , ei, Ji, . . . , Tj\ £ F /x2/ . Then the following are 
equivalent: 

(a) T is superregular, i.e. all proper submatrices in the sense of Definition \3.3\ are nonsin- 
gular. 

(b) Assume there are indices 1 < i\ < . . . < i r < n, 1 < j'2 <•••<>< k. Then all proper 
submatrices ofT of the form T 1 n j* 2 '"^ r are nonsingular, 

1 We adopt this notion from [23 , where it has been coined in a slightly different context. 



T=[T u ...,Ti\ 



h 
hi h 



h 



i-i 



'■■ 

hi h 
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(c) wt [Ti + J2j=i PjTmj) > I — s for all 1 < s < I — 1, all 1 < mi < . . . < m s < I and all 

(d) T x ^ span{T mi , . . . , T ms , e h , . . . , e k } where 1 < mi < . . . < m s < I and 1 < l x < . . . < 
It < I and s + t <l-l. 

(e) Ifve W 21 satisfies vH T = and v i+1 ^ 0, then wt(v) > I + 1. 

(f) e 1 ^ span{T mi , . . . , T mal e hl . . . , e k } where 1 < m 1 < . . . < m s < I and 1 < h < . . . < 
l t < I and s + t < I - 1. 

(g) Ifve F 21 satishes vH T = and v% ^ 0, then wt(v) > I + 1. 

PROOF: (a) <^> (b) is obvious since in case of properness the Toeplitz structure implies 



mil,...,l r 
jl )•••>> 



rpil— jl+l,...,i r — 

ii-ji+ivjV-ji+i • 



(b) ^» (c): Let h :— T x + Ylj=i fijT mj and assume to the contrary wt{K) < I — s. The 
assumption implies that h consists of at least s+1 zero entries, say at the positions ii, . . . , i s +i. 
Then 



rpi\,...,i s + l 

\,m\,...,m t 







(3.7) 



The superregularity yields m v > i u+ \ for some v e {1, . . . , s}, which we can choose to be 
minimal with this property. Then the submatrix Tml"" %v m] is identically zero and therefore 
we obtain from ()3.7|) the identity Tj 1 ^"'^ m (1, Pi, ... , (3 u -iY = 0, a contradiction to super- 
regularity since by minimality of v this coefficient matrix is nonsingular. 
(c) =^ (b): Assume to the contrary that det T 1 ^' i ''* s +^ s = for some indices satisfying 
m„ < for v = l,...,s. We can assume s to be minimal with this property. Then 
there exists {(3 , fix, ... , (3 S ) G F s+1 \{0} such that T^f 3 ^ (/3 , . . . , f3 s f = 0. Minimality of 
s and the equivalence of (a) and (b) imply /3 ^ 0. Hence we can take (3q = 1 and (j3.7|) is 
satisfied. Thus wt(T! + Ylj=i Pj^m,) < / — (s + 1), a contradiction. 
The properties (d) and (e) are simply reformulations of (c). 

The equivalence (d) -v^ (f) is clear from the structure of H (a linear combination of T\ by 
the other columns of H has to involve the column e\ and vice versa). 

The property (g) is a reformulation of (f). □ 
The equivalence of (e) and (g) immediately implies 

Corollary 3.6 IfT 6 F' x ' is a superregular lower triangular Toeplitz matrix, then so is T -1 . 

The following lemma is the main step for establishing the existence of superregular ma- 
trices of Toeplitz-structure. 
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Lemma 3.7 Let ¥ be a field and X 1} . . . , Xi be independent indeterminates over ¥. Define 
the matrix 

OTi ••• 



A : = 



e¥(X 1 ,...,X l ) lxl . 



X2 X\ ■ ■ : 

■ ■-. ■-. 

jX-i • ■ ' X2 x 1 
Then A is superregular. 

PROOF: We proceed by contradiction. Assume there exists a singular proper submatrix 

i:=4V"'t- 

We can take the size r to be minimal. Then certainly r > 1. By properness we know that 
j v < i v for v — 1, . . . , r. 

Notice that for /j, < v the entry of A at the position {y, //) is given by A 1 ^ = X^_^ +1 . Hence 
the indeterminate with the largest index appearing in A is X ir _j 1+ i. It appears only once in 
the matrix and that is in the lower left corner. Thus its coefficient in det A is ± det A, where 

4 — 4'i.-,v-i 

Singularity of A now implies det A = 0. By minimality of r this yields that A is an improper 
submatrix of A, i. e. there exists an index r G {2, . . . , r} such that j T > i r _i. Picking r 
minimal we get %\ < ... < i r _i < j T < . . . < j r and therefore the first r — 1 rows of A have 
the form 

"* ••• * ••• 0" 



* 







where the block of possibly nonzero elements consists of r — 1 columns. Hence A is a 
blocktriangular matrix and we have 

= det A = det AV'-'V" 1 det A^-'r 

Since both factors are proper minors we get a contradiction to the minimality of the size r. 

□ 

The following consequence is standard. 

Theorem 3.8 For every I G N and every prime number p there exists a finite field ¥ of 
characteristic p and a superregular matrix T G ¥ lxl having Toeplitz structure. 

Proof: Consider the prime field ¥ p and the matrix of the previous lemma with entries 
in ¥ p (Xi, . . . ,Xi). All its proper minors are nonzero polynomials in F p [Ai, . . . ,Xi\. Over 
an algebraic closure ¥ p a point a := (ai, . . . , a/) G can be found such that none of the 
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minors vanishes at a. Hence the Toeplitz matrix T having (ai, . . . , a/) T as its first column is 
superregular. Since each is algebraic over F p , the matrix T has its entries in a finite field 
extension F of F p . □ 
In particular, for every size I G N there exist superregular Toeplitz matrices over a field 
of characteristic 2. Unfortunately, the theorem above is nonconstructive and it is not at all 
clear what the minimum field of characteristic 2 is to allow a superregular Toeplitz matrix 
of given size / x I. We present some examples. 

Example 3.9 (1) Using a computer algebra program one checks that the following matrices 
are superregular. The first examples are all over prime fields F p . 



1 0' 
1 1 



G F 



2x2 



1 0' 

1 1 

2 1 1 



G Ff 3 , 



1 











1 


1 








2 


1 


1 





1 


2 


1 


1 



G F 



4x4 



1 














2 


1 











1 


2 


1 








6 


1 


2 


1 





4 


6 


1 


2 


1 



G Fj 



5x5 



1 

















2 


1 














1 


2 


1 











1 


1 


2 


1 








3 


1 


1 


2 


1 





4 


3 


1 


1 


2 


1 



G F^ x6 , 



1 




















7 


1 

















13 


7 


1 














2 


13 


7 


1 











1 


2 


13 


7 


1 








4 


1 


2 


13 


7 


1 





14 


4 


1 


2 


13 


7 


1 



G F£ 7 . 



The following examples represent superregular matrices over finite fields of characteristic 
2. For this assume that a, (5 and 7 satisfy 

a 2 + a + 1 = 0, /3 3 + /3 + 1 = 0, and 7 4 + 7 + 1 = 0. 

Then the following matrices represent superregular matrices over F 4 , F 8 and Fi 6 respec- 
tively. 



'1 

a 1 
1 a 1 



G F 3 2 x3 , 



Assume e, uj satisfy 



G F^ 5 , 



1 

(3 1 

P 1 

/3 (3 3 (3 1 
1 /3 (3 3 (3 1 



e & + e z + 1 = and u b + + 1 



1 






7 


1 




7 5 


7 


1 


7 5 


7 5 


7 


7 


7 5 


7 5 


1 


7 


7 5 


1 = 


0. 





G F^ 4 x6 . 
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Then the following matrices represent superregular matrices over F 32 and F 64 respectively. 



1 

e 1 

e 6 e 

e 9 e 6 

e 6 e 9 

e e 6 

1 e 



e 

^6 



G F 



7x7 
25 5 



1 























u; 


1 




















cu 9 


UJ 


1 

















UJ 33 


UJ 9 


UJ 


1 














uj 33 


to 33 


UJ 9 


UJ 


1 











uj 9 


UJ 33 


UJ 33 


UJ 9 


UJ 


1 








UJ 


CO 9 


UJ 33 


UJ 33 


UJ 9 


UJ 


1 





1 


UJ 


UJ 9 


UJ 33 


UJ 33 


UJ 9 


UJ 


1 



G F* 6 x8 . 



Notice that the matrices above have even more symmetry than required. One can easily 
show that there is no superregular 4 x 4-matrix over F4 of general Toeplitz structure. 
However, the above suggests to ask whether one can find for every / > 5 a superregular 
I x /-Toeplitz matrix over F 2 i-2. 

(2) In the appendix we prove that for every n G N the proper minors of the Toeplitz-matrix 

-(V) 

(V) ("«') 



T 



e Z" 



.CCD ■■■ (V) Co 1 ). 

are all positive. Hence for each n G N there exists a smallest prime number p n such that 
T n is superregular over the prime field F Pn . One can check that 

P2 = 2, p 3 = 5, pi = 7, p 5 = 11, p 6 = 23, p 7 = 43. 

Now we can establish the existence of strongly MDS codes in the following sense. 

Theorem 3.10 For every n, 5 G N and every prime number p there exists a strongly MDS 
code with parameters (n,n — 1,5) over a suitably large Geld of characteristic p. 

The proof of this theorem is rather long and technical and because of this reason it is 
put into the appendix. 

There is of course the natural question if strongly MDS convolutional codes and codes 
with maximum distance profile exist for all parameters (n,k,5). We strongly believe so. 
The section showed that such codes exist for all parameters (n,k,5) with k — n — 1. For 
all small values of (n, k, 5) we have found strongly MDS convolutional codes and codes with 
maximum distance profile making computer searches. In the next section we present a series 
of examples of such codes found through computer searches. Based on this wealth of data 
we conjecture: 

Conjecture 3.11 For all n > k > and for all 5 > there exists an (n, k, 5) code over a 
sufficiently large field which is both strongly MDS and has a maximum distance profile. 
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4 Examples 



In this section we will present some examples of strongly MDS codes with small parameters. 
The first set of examples is constructed according to the proof of Theorem QQH by utilizing 
the superregular matrices in Example 13.91 

Example 4.1 Recall the first part of the proof of Theorem 13.101 

(1) We can construct strongly MDS (2, l,5)-codes once a r x r superregular matrix, where 
t = 25 + 1, is available. Thus, the 5x5 and 7x7 matrices given in Example 13.9( 1) 
lead to the strongly MDS (2, 1, 2)-code over F 8 (where /3 3 + (3 + 1 =0) with parity check 
matrix 

H = [a, b] = [1 + (3 2 D + (3 5 D 2 } 1 + ^D + (3 5 D 2 } £ ¥ S [D} 2 

and to the strongly MDS (2, 1, 3)-code over F 32 (where e 5 + e 2 + 1 = 0) with parity check 
matrix 

H = [a, b] = [l + e 18 D + e n D 2 + e 29 D 3 , 1 + D + e 27 D 2 + e 18 D 3 ] 6 ¥ 32 [D] 2 . 

Indeed, one checks that 

1 + 3 4 D + 6 5 D 2 

l + f3 2 D + f3 , D 2 = ^ + PD + (3 3 D 2 + (3D 3 + + higher powers 

and 

1 + D + e 27 D 2 + e 18 D 3 



1 + e 18 D + e u D 2 + e 29 D 3 



1 +eD + e 6 D 2 +e 9 D 3 + e 6 D 4 + eD 5 + D 6 + higher powers. 



Hence the free distance of the two codes above is 6 (resp. 8), and this is also the 4th 
(resp. 6th) column distance. 

(2) Using the 8 x 8-superregular matrix of Example I3.9f 1). one can construct a strongly 
MDS (3, 2, 2)-code over F 64 . Hence the code has free distance equal to its 3rd column 
distance, and this value is 5. Using the construction of the proof of Theorem 13.101 and 
going through some tedious calculations in the field F 64 (where u; 6 + uj + 1 = 0) one 
finally arrives at the parity check matrix 

H = [1 + co 57 D + u 62 D 2 , u + u u D + u 54 D 2 , 1 + uj 11 D + u 21 D 2 ] e ¥ 3 4 . 

(3) A strongly MDS (4, 3, l)-code has free distance 3 and this is identical with the first column 
distance. It can be obtained from a 6 x 6-superregular matrix using the construction of 
the proof of Theorem 13.101 Indeed, the matrix 



H 



1 
1 



7 5 7 1 

1 7 7 5 7 5 7 1 
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has been obtained from the superregular Toeplitz matrix of Example I3.9f 1) and thus it 
satisfies property (b) of Theorem 13.11 Hence a parity check matrix of a strongly MDS 
(4, 3, l)-code over F 16 (where 7 4 + 7 + 1 = 0) is given by 

H = [1, 7 5 + D, 7 + <yD, 1 + 7 5 £] e ¥ 16 [D] 4 . 

(4) Of course, not every MDS code is strongly MDS. For instance, the code with parity check 
matrix H = [10 + 3D + 2D 2 , 4 + 2D + D 2 ] G F n [D] 2 is an MDS code, but not strongly 
MDS. In this example, the MDS property follows from the fact, that this code is the 
result of the construction of MDS codes as presented in [2E]. However, a (2, 1, l)-code is 
strongly MDS iff it is an MDS code. This can be checked directly by using Theorem 13.11 
and the fact that for the (basic) parity check matrix [ao + aiD, &o + b\D] of an MDS code 
all coefficients as well as a^bi — a^o are nonzero. 

The next series of examples has been found by completely different methods. They are 
all cyclic convolutional codes in the sense of [U El EH lEj- In those papers convolutional 
codes having some additional algebraic structure are being investigated. This additional 
structure is a generalization of cyclicity of block codes but is a far more complex notion for 
convolutional codes. In particular cyclicity of convolutional codes does not mean invariance 
under the cyclic shift in F((D)) n . We will not go into the details but rather refer to [U 
E]- However, in order to understand and test the following examples there is no need in 
understanding the concept of cyclicity for convolutional codes since below we provide all 
information needed to specify the codes. We present the generator matrices and also provide 
all column distances; they have been computed with a computer algebra program. All 
matrices given below are minimal basic in the sense of Definition 11.11 We would like to 
mention that just like for cyclic block codes, the length of the code and the characteristic of 
the field have to be coprime. Therefore, only codes with odd length are given below. 

One should note that most of the following codes exist over comparatively smaller alpha- 
bets than the examples of 14.11 However, we don't know any general construction for strongly 
MDS cyclic convolutional codes yet. But the abundance of (small) examples suggests that 
such a construction might be possible and might lead to smaller alphabets for given param- 
eters than the construction of the last section. We will leave this as an open question for 
future research. 

Example 4.2 (1) A strongly MDS (3, 1, l)-code over F 4 : 

G = [a + acD, a 2 + aD, 1 + aD]. 

The column distances are d$ = 3, d\ = 5, d c - = 6 for j > 2. 
(2) A strongly MDS (3, 1, 2)-code over F 16 (where (3 4 + (3 + 1 = 0): 

G =[/3 + (3D + D 2 , (3 e + (3D + (3 W D 2 , (3 n + (3D + (3 r °D 2 ]. 

The column distances are d$ — 3, d± = 5, d% = 7, dj — 9 for j > 3. 
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(3) A strongly MDS (3, 2, 2)-code over Fi 6 : 



G 



' (3 5 + (3 4 D (3 3 + (3 8 D (3 9 + (3 2 D 
(3 9 + (3 12 D (3 5 + (3 u D (3 3 + (3 3 D 



The column distances are = 2, d\ = 3, d c 2 = 4, dj = 5 for j > 3. 
(4) A strongly MDS (5, 1, l)-code over F 16 : 



G=[(3 + (3D, (3 13 + (3 l0 D, (3 10 + (3 A D, (3 7 + (3 l3 D, (3 A + (3 7 D\. 



The column distances are c?g = 5, dl — 9, dj = 10 for j > 2. 

(5) A strongly MDS (5, 1, 2)-code over F 16 : 

G = [(3 + (3 A D + (3D 2 , (3 7 + (3D + (3 10 D 2 , (3 13 + (3 13 D + (3 A D 2 , 

(3 4 + (3 10 D + (3 13 D 2 , (3 10 + (3 7 D + f3 7 D 2 ]. 

The column distances are d$ = 5, d\ = 9, d c 2 = 13, d'j = 15 for j > 3. 

(6) A strongly MDS (5,2,2)-code over F 16 : 



The column distances are c?g = 4, d\ = 7, dj = 9 for j > 2. 

(7) A strongly MDS (7, 1, l)-code over F 8 (where 7 3 + 7 + 1 = 0): 

G = [7 + jD, 7 3 + £>, 7 5 + 7 6 A 1 + 7 5 ^, 7 2 + 7 4 A 7 4 + 7 3 A 7 6 + 7 2 ^]- 

The column distances are d$ = 1 , d\ = 13, dj = 14 for j > 2. 

(8) A strongly MDS (7, 1, 2)-code over F 8 : 

G = [ 7 2 + lD + D 2 , 7 5 + 7 3 D + 7 6 J D 2 , 7 + 7 5 D + 7 5 L> 2 , 7 4 + D + ^D 2 , 



The column distances are d$ = 7, d\ = 13, c?2 = 18, dj = 21 for j > 3. 

(9) It is worth being mentioned that there does not exist even an MDS (7, 2, 2)-code over 
F 8 , since the generalized Singleton bound in this case is 13, but due to the Griesmer 
bound (see JSJ p. 133] for the binary case) the parameters of an (n, k, <5)-code over ¥ q 
with memory m and distance d satisfy 



G 



(3 + (3D (3 13 + (3 10 D (3 W + (3 4 D (3 7 + (3 13 D (3 4 + (3 7 D' 
1+(3 5 D (3 3 + (3 u D (3 6 + (3 2 D (3 9 + (3 8 D (3 12 + (3 U D 



1 + 7 2 D + 7 3 J D 2 , 7 3 + 7 4 D + 7 2 £> 2 , 7 6 + 7 6 L> + 7D 2 ]. 



E 



dl 



< n(m + i) for all % G Nq. 



1=0 
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Hence a (7, 2, 2)-code over Fg with memory 1 has at most distance 12. The inequality 
applied to % = 1 shows that the field size has to be at least 13 in order to allow the 
existence of an MDS (7, 2, 2)-code. 

One should notice that the codes in Example 14.2( 1) - (7) are not only strongly MDS but 
also have all column distances being optimal in the sense that they reach the upper bound 
given in Proposition 12.31 In particular they also have a maximum distance profile in the 
sense of Definition 12.91 For the (7, l,2)-code in (8), only the second column distance is not 
optimal, but rather one less than the upper bound, which is 19 in this case. 

5 The Dual of a Strongly MDS Code 

In this section we will present some results concerning the dual code of a strongly MDS 
code. The main result shows that a convolutional code has a maximum distance profile if 
and only if its dual has this property. This then implies for certain parameters that a code 
is strongly MDS if and only if its dual has this property. These results are very appealing 
as it generalizes the situation for block codes. 
Recall that if 

C = {uG | u G ¥((D)) k } = {ve ¥((D)) n | vH T = 0} C ¥((D)) n 

is an (n, k, 5)-code with generator matrix G G F[-D] fexra and parity check matrix H G 
F[.D] (n ~ fc)xr \ then the dual code, defined as 

C x = {we ¥((D)) n | wi7 = for all v G C}, 

is given by 

C L = {uH | u G ¥((D)) n - k } = {we ¥{{D)) n | wG T = 0} 

and thus an (n, n — k, 5)-code. In contrast to the block code situation almost nothing is 
known about the relation between the distances of a code and its dual. In particular, it 
has been shown in j2Hj that no MacWilliams identity relating the weight distributions of C 
and C x exists. In block code theory a very simple relation between the distances of a code 
and its dual is given in the case of MDS codes. In fact, if C is an MDS (n, fc)-block-code, 
then the dual C x is an MDS (n, n — k)-code, see |14[ Ch. 11, §2] and very specific knowledge 
on the weight enumerator and its dual is known |fT4l Ch. 11]. Therefore, it is quite natural 
to investigate whether the dual of an MDS (or strongly MDS) convolutional code is MDS 
(or strongly MDS), too. Unfortunately, this is in general not the case. 

Example 5.1 In general the dual of a strongly MDS code is not even an MDS code. This 
can be seen from the dual of the code given in Example 14.1( 3). The dual has generator 
matrix G = [1, 7 5 + D, 7 + 7D, 1 + 7 5 -D] G Fi@[-D] 4 which obviously has weight less than the 
generalized Singleton bound 8 (see Theorem 12. 5J) . 
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As we will show next the property of maximum distance profile carries over under du- 
alization. In addition, for specific code parameters the strong MDS property carries over 
to the dual code as well. To this end, recall from Definition 12.71 that an (n, k, 5)-code is 
strongly MDS if the Mth column distance attains the generalized Singleton bound where 
M = |_f J + • Thus the dual code C L is MDS if the Mth column distance attains the 
generalized Singleton bound where M = L^z^J + [fl • Obviously, these two numbers differ 
by one when k divides S but n — k does not or vice versa. What remains equal for both the 

used in Definition I2JU where we introduced 



code and its dual is the quantity L— | + 
the concept of maximum distance profile. 

Before we state the main results we need a technical lemma. 

Lemma 5.2 Let A e ¥ kxn and B e F nx ( n ~ fc ) such that 



AB = and rank A = k, rank B = n — k. 
Then the following are equivalent: 

(a) the k x k-submatrix of A consisting of the columns with indices 1 < ti < . . . < < n 
is singular, 

(b) The (n — k) x (n — k)-submatrix of B obtained by taking the rows with indices in 
{1, . . . , n}\{ti, . . . , t k } is singular. 

PROOF: Without loss of generality assume (ti, ...,£&) = (1, . . ., k) and partition A = 
(Ai A 2 ), where A\ is the k x k sub matrix under consideration. If A\ is invertible then 



ker A = colspanj; 



A- X V M 



Ln—k 



colspan F (5). 



This shows that the bottom (n — k) x (n — A;)-submatrix of B is invertible. □ 
This lemma, in conjunction with Theorem 12.121 immediately gives an algebraic criterion 
for maximum distance profile codes in terms of a parity check matrix. 

Theorem 5.3 Let H = J^j=o ^ e the parity check matrix of an (n,k,5)-code. Let L 

be defined as in (j2.11|) and let 



HI: 



H 

Hi H 



Ht, H, 



L-l 



Ho 



g J7(i+l)(n-fc)x(i+l)n 



(s.i; 



Then H represents a maximum distance prohle code if and only if every (L + l)(n — k)x(L + 
l)(n — k) full-size minor formed from the columns with indices 1 < %\ < . . . < i(L+i)(n-k)> 
where i B ( n -k) ^ sn f° r s = 1, . . . ,L, is nonzero. 
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PROOF: Let the code have generator matrix G as given in (|2.1|) . Recall that G c L (HlY — 
and both factors have full rank. By Theorem 12.121 the code has maximum distance profile 
if and only if every full size minor G C L formed from the columns 1 < j\ < . . . < j(L+i)k, 
where j s k+i > sn for s = 1, . . . , L, is nonzero. Now the complimentary minors of if£ have 
indices 1 < i\ < . . . < i(L+i)(n-k) satisfying i s ( n -k) < sn for s — 1, . . . , L. Thus Lemma I5~2l 
completes the proof. □ 

With this we have a nice duality result: 

Theorem 5.4 An (n, k, 5)-code C C ¥((D)) n has a maximum distance profile if and only if 
the dual code C L C ¥([D)) n has this property. 

PROOF: Let C have generator matrix G and parity check matrix H as given in (|2.1|) 
and ()2.2j) . Assume C has a maximum distance profile. By Theorem 15.31 every (L + l)(n — 
k) x (L + l)(n — k) full-size minor formed from the columns of iJ£ with indices 1 < i\ < 
... < i( L+ i)( n _ fc ), where i s ^ k ) < sn for s = 1, . . . , L, is nonzero. 

Consider now the dual code C L which is defined as the rowspace of the (n-k)xn matrix 
H . It follows from ()2.6J) that the Lth column distance of the dual code C 1 - is given by 

d c L = min {wt(K, . . .,Uo)HQ \ u, G F n ~ k , u + 0}. 

Taking the reversed ordering into account we obtain from Theorem 12 . 1 21 that the dual code 
C x has maximum distance profile as well. □ 

Corollary 5.5 When both k andn — k divide S then an (n, k, S)-code C C ¥([D)) n is strongly 
MDS if and only if C ± C F([D)) n has this property. 

PROOF: From k \ S and (n — k) \ 5 it follows that L = M and d c M = (n-A;)(| + l) +5+1, 
the generalized Singleton bound of the code C and d c M = k(-^x + 1) +5 + 1, the generalized 
Singleton bound of the dual code C ± . □ 

The result above gives us another class of strongly MDS codes by dualizing Theorem 13. 101 

Corollary 5.6 For every n, 5 G No such that (n — 1) | 5 and every prime number p there 
exists a strongly MDS (n, 1, 5) -code over some suitably large field of characteristic p. 

Example 5.7 (a) Corollary 15.51 tells us that the duals of the (2, 1, <5)-codes given in Exam- 
ple l^^l) are strongly MDS. But this is obviously so, since they are — up to ordering 
— identical to the given codes. 

(b) Dualizing the code of Example 14. 1( 2) gives us a strongly MDS (3, 1, 2)-code with gener- 
ator matrix 

G = [1 + u B7 D + cu 62 D 2 , u + cu M D + co 54 D 2 , 1 + co 17 D + cu 2l D 2 } G F^ 4 . 
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(c) Dualizing the codes given in Example I4.2f 2) and (3) we obtain another two strongly 
MDS codes with generator matrices 



Hi 



1 (3D + (3 9 (3 6 D + (3 8 
(3 U D (3 7 D + (3 6 (3 8 D + (3 



G F?f 



and 



H 2 = [D 2 + D + (3 2 , (3 W D 2 + D + /3 7 , (3 5 D 2 + D + (3 12 } G F? 6 . 

It is known that these codes are also cyclic convolutional codes in the sense of jl], see jU 
Thm. 7.5]. 

Finally we would like to mention that even in the case where k \ 5 and (n — k) | 5, the 
dual of an MDS code is not MDS in general. An example is given by the following code. 

Example 5.8 The (3, 1, 2)-code C C F((.D)) 3 , where F = Fie, with generator matrix 

G = [1 + (3D + (3*D 2 , (3 10 + (3 2 D + (3 A D 2 , (3 8 + (3 5 D + D 2 ] 
and parity check matrix 



H 



' 1 (3 14 D + p 2 (3 3 D + p 3 ' 
(3D (3 n D + (3 8 (3 10 D + (3 10 



is an MDS code, but not strongly MDS. It satisfies = 8 and d\ = 9. The dual code 
generated by H is not MDS. Its distance is 4. 



6 Decoding Strongly MDS Codes 

The codes discussed in the previous section have the property that they allow a very good 
feedback decoding [THj if the error distribution is reasonably mild. 

Let us briefly recall the concept of feedback decoding. Assume the codeword v = 
^A >Q VjDi G C has been sent and the word v = ^2j >Q VjD^ G ¥((D)) n has been received. 
Write v = v + e, where e = X]j>o e i-^ ) ' ? ^ s ^ ne error vector. In the j-th cycle of feedback 
decoding one corrects Vj (hence estimates 6j) and then feeds back this information into the 
decoding algorithm before proceeding with the next decoding step. It should be intuitively 
clear that the next step will benefit from the resetting v v — e^DK As for the decoding 
step itself one estimates €j upon the knowledge of the received sequence Vj, . . . The 
length I + 1, of course, depends on the distance properties of the code. This estimate will be 
correct if not too many errors have occurred on this string. 

In the sequel we will show that strongly MDS codes of rate have very good error 
correcting capabilities in terms of the maximum number of errors acceptable on each string 
without jeopardizing correct decoding. The basis of the feedback decoding algorithm is the 
following simple reformulation of the distance properties for the parity check matrices. 
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Proposition 6.1 Let C C ¥((D)) n be a strongly MDS (n,n — l,S)-code and put M : = 
L^J + 5. Let H C M E ¥(M+i)x(M+i)n be the M _ th parity check matrix Q f c and e := 

(e , • • • , e M ), e : = (e , . . . , e M ) E F M+1 . Assume 

e(H c M f = e(H c M f and wt(e), wt(e) < 

Then 

(a) e = e , 

(7?) if M is even, then additionally e 1 = e 1 . 
Notice that M is even for codes with rate 1/2. 

PROOF: Put rj := (n , . . . , rj M ) where rjj = ej - ej for all < j < M. Then n(H° M Y = 
and wt(?7) < M + 1. Thus Proposition 12.21 yields r/o = 0. As for (b) notice that if M is even, 
then wt(e), wt(e) < 4f and therefore wt(r/) = wt(?7i, . . . , t]m) < M. Now = implies 
(?7i , . . . ,T)m)(Hm-iY — and Proposition 12.21 together with Corollary 12.41 finishes the proof. 

□ 

Observe that the proposition above says that the list of M + 1 consecutive syndromes 
determines uniquely the error in the first position. This can be iterated and leads to the 
following algorithm, which at least works reasonably well for small codes. The question how 
to practically compute the error in the first position from the syndrome vector for large codes 
will be addressed at the end of this section. 

We will make use of the notation in Remark 12.11 

Theorem 6.2 Let C C ¥([D)) n be a strongly MDS (n, n—1, 8)-code with parity check matrix 
H E ¥[D] lxn and H C M E F( M + 1 ) x ( A/+1 ) ri as its M-th sliding parity check matrix. Assume the 
codeword v EC has been sent and the word v E ¥([D)) n has been received. Without loss of 
generality assume v , v > 0. Put v = v + e, thus e E F((D)) n is the error vector and assume 
that any sliding window of length (M + l)n contains at most errors, i. e. 

wt(eyj +M] ) < M ^ 1 for all j > 0. (6.1) 

Then the following algorithm will decode v correctly, i. e. for each j = 0,1,2,... we have 

V[oj] = V[o t j] after the j-th cycle: 

Put j := -1. 

Step 1: Put j := j + 1. 

Step 2: Compute the syndrome vector S := (vH T )yj +M }. 

Step 3: From the syndrome vector S determine the unique rjo E ¥ n , such that S = n{H c M Y 
for some r\ = (r] , . . . , r) M ) E F( M+1 ) n satisfying wt (rj) < ^j^-- 

Step 4: Put v := v — rj D j . 

Step 5: Go to Step 1. 
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Remark 6.3 For illustration purposes assume n = 2, i.e. the rate is 1/2. Theorem 16 . 21 then 
states that a strongly MDS (2, 1, <5)-code can be correctly decoded as long as there are no 
more than 5 errors in any sliding window of length 45 + 2. This has to be compared with a 
MDS block code of rate k/n where n = 2k = 45 + 2 which is capable of decoding correctly 
5 errors in any slotted window of length n. Similar comparisons can be made for different 
values of n. 

PROOF: We first have to show the existence of rj as required in Step 3 and the uniqueness 
of r] . In order to do so fix some j > 0. It is easy to see that for all w G ¥([D)) n with tiJ > 
one has 

i wHT )[j,j+M] = W[jj+M] (H C M ) T + w [Q j- X ]Hj (6-2) 

where 

- H s ■■■ H x - 

H j+ i ■■■ H 2 
Hj = . . 

Hj+M ■ • • H M+ i_ 

Since, due to the previous decoding steps we have V[oj-i\ = ^oj-i], which is the correct 
codeword sequence, we get 

= (vH T ) [jJ+M] = v [jtj+M] (H^) T - e\j, j+ M}{HM) T + v^j^Hjf . 

Again with ()6.2)1 this yields 

S = (vH T ) [jJ+M] = e [jd+M] (H c M Y 

and the assumption (jfj.ljl together with Proposition 16.11 establish the existence of rj as well 
as the uniqueness of r/ as required in Step 3. 

It follows directly from the above that r] = €j, where r] is computed in Step 3 of the j-th 
cycle. Thus we have V\ j] = up^-j after the j-th cycle. □ 

Remark 6.4 One might wonder how the algorithm above is related to the total error cor- 
recting bound t := [ ^ frc 2 c ~ 1 j of the code. First notice that = frg |~ x = ^k^. From this 
it follows that for each received word v there exists at most one codeword v G C such that 
v — v satisfies (|6.1|) . This codeword, of course, is then the result of the algorithm above. 
However, it might happen that there are two codewords v\, v 2 G C such that the total dis- 
tances satisfy wt(v — v\) = wt(v — v 2 ) = d(v, C) := min{wt(-0 — v) \ v G C}. Hence v\ and v 2 

have equally close distance to v when considered over the total length [0, oo). This of course 

d -l 

can happen only if d(v,C) > [-^ — J- From the above we know that at most one of these 
codewords can have an error vector satisfying (|6.1j) . In this situation the decoding algorithm 
will try to successively minimize wt((t> — ^Ob'.j+A/]) over all codewords v G C and j > 0. 
This situation arises for instance for the strongly MDS code C with parity check matrix 

H=[l + (3 2 D + (3 5 D 2 , 1 + p A D + {3 5 D 2 } G F 2 a [D] 2 (where (3 3 + (3 + 1 = 0) 
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given in Example I4.1f 1) and having free distance d l[CC = 6. In this case the received word 

v = (J3D + (3 5 D\ f3 3 D 2 + (3 2 D 3 ) 

satisfies wt(v) = wt(v — V\) — 4 = d(v,C) for the codeword 

v x = {l + pD + ^D* + (3 2 D 5 , 1 + [3 3 D 2 + (3 2 D 3 + (3 2 D 5 ) e C. 

Hence v is equally close to v\ and the zero codeword, but only v — v\ satisfies the error 
condition (J6.1j) . Therefore, the decoding algorithm will decode v into the codeword V\. 

The main step of the algorithm in Theorem 16.21 is. of course, the determination of 7] from 
the syndrome vector in Step 3. For codes with small parameters this can easily be achieved 
by simply checking (in a smart way) all linear combinations of at most columns of 

H*m- But for larger codes this is unsatisfactory and one would like to know an algebraic 
computation of r/ . Unfortunately, thus far we cannot offer such an algebraic decoding. It 
will certainly depend on an algebraic construction of superregular matrices along with their 
algebraic properties. 

We close this paper with the following criterion which, in the affirmative case, speeds up 
Step 3. It makes use of the systematic sliding parity check matrix of the code, see (|3.3|) . 
which can be used just as well in the decoding algorithm. Notice that there are n different 
systematic M-th sliding parity check matrices for an (n, n — 1,5) code. Therefore, the 
following criterion can be tested n times and hopefully leads to an immediate decision on rjo 
at least ones. 

Proposition 6.5 Let C C ¥((D)) n be a strongly MDS (n,n — 1,5) -code with systematic 
M-th sliding parity check matrix H as in ()3.3|) . Let S = (So, . . . , Sm) € ¥ M+1 be such that 
S = f)H T for some 

f) = (e , . . . , e M , f , . . . , f M ) e F^+^K- 1 ) and wttf) < ^±1 

Ifwt(S) < [^±1], then f = and e = So- 
Proof: The assumptions wt(S) < \^^-] and wt(f/) < ^1 imply that there exists 
a linear combination of at most M + 1 columns of H giving the zero vector. But then 
Theorem 13. II yields f = and e = S . □ 

7 Conclusion 

In this paper we introduced two new classes of convolutional codes called strongly MDS con- 
volutional codes and codes having maximum distance profile. Strongly MDS convolutional 
codes have the property that the generalized Singleton bound is attained at the earliest 
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possible column distance. Codes with maximum distance profile have a maximal possible 
increase of the column distances. 

It is shown that strongly MDS convolutional codes perform excellent under feedback 
decoding. The number of errors which can be iteratively decoded per time interval lets these 
codes compare with MDS linear block codes having a considerable block length. At this 
point the feedback decoding algorithm we presented is not powerful enough to practically 
decode strongly MDS convolutional codes when the degree 5 and the block length n are too 
large for the syndrome decoding step, see Theorem 16 . 21 and Proposition 16 . 5l for details. It will 
be a matter of future research to construct strongly MDS convolutional codes which come 
equipped with an algebraic structure and an efficient decoding algorithm comparable to the 
situation of Reed-Solomon block codes. The class of cyclic convolutional codes IU HH1 EH] 
might hold some promise here. 

From an applications point of view strongly MDS convolutional codes are particularly 
suited in situations where codes over large alphabets are required and in situations where al- 
gebraic decoding is desirable. Hadjicostis [TJ [HJ has recently demonstrated that convolutional 
codes over large alphabets are very desirable in areas of process control via linear finite state 
machines where large numbers of non- concurrent errors should be detected and corrected. 
It seems that strongly MDS convolutional codes have potential for such applications. 



Appendix 

We will prove that the proper minors of the matrix T n given in Example IH.9( 2) are all 
positive. In order to do so consider the matrix 



X 



1 

1 1 
1 1 



1 1 
1 1 



G Ti- 



ll 



and notice that for all k € {1, . . .,n — 1} we have 



X 



(!) i 
© © i 



© © i 
■■■ © © 



(A.l) 



1 © © lJ 

In particular, X™ -1 = T n . Therefore, the positivity of the proper minors is a consequence of 
the following theorem. 

Theorem A Let k e {1, . . . , n — 1} and 1 < %\ < . . . < i r < n, 1 < ji < . . . < j r < n and 
define X := (X k )^'-' i r . Then det X > and 

det X > <^=^> G {i;, ^ — 1, . . . , %i — k} for all I = 1, . . . , r. 



Proof: 1) We first show that 

ji € {h: k — 1) • • • ) k ~ k} for some / ^> det X = 0. (A. 2) 

To this end notice that 

Xij = for j > i or j < i — k 

and thus 

X ef = X iejf = for j f > i e or j f <i e - k. 

Assume now ji > i\ for some /. Then for all e < I and / > I we have jf > ji > i\ > i e and 
thus X e f = 0. Hence the first / rows of X have at most rank / — 1 and thus detX = 0. 
Similarly, if ji < i\ — k for some /, then we have X e f = for all e > I and / < / and the 
first / columns of X have at most rank I — 1. 

2) It remains to prove the implication u .^=" of the equivalence given in the theorem. 
We begin with proving the statement for k — 1, i. e. for the matrix X. In order to do so, we 
proceed by induction on r. For r = 1 we have to consider the submatrices X£ and XH_ V 
They all trivially have determinant 1. Now let r > 1. We suppose the statement is true for 
all (r — 1) x (r — 1) proper submatrices with the according restriction on the indices and we 
have to show that the assertion is also true for X = X l ^''fj r where ji e {k,k — 1} for all /. 
Notice that the first column of X has either one or two nonzero entries and they are equal 
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to 1. If the first column of X has one 1 only, then it is on the first row. Applying cofactor 
expansion along that column we obtain 



detX 



1 ■ det X' 2 ' 

J2 i 



(A.3) 



The (r — 1) x (r — l)-submatrix satisfies ji G {ii, ii — 1} for all I — 2, . . . , r and hence by 
induction has positive determinant. This proves detX > in this case. If the first column 
of Xjl''"j r r has two entries equal to 1, then they are necessarily on the first two rows, thus 
*2 = H + 1 and j\ = i\. Since j 2 G {«2,*2 — 1} = {ii + and j 2 > ji, we can only 

have J2 = it + 1. Then the first row will have only one nonzero entry equal to 1 on the first 
position, and applying cofactor expansion along that row, we obtain again (|A.3|) and thus 
detX > 0. 

We now proceed by induction on k in order to prove the desired result for X k where k > 1. 
Assume X k ~ 1 has the stated property. Using X k = X ■ X k ~ 1 and the Cauchy-Binet formula 
for minors we obtain 



detX 



E 



det XI 



det{X h 



-,Sr 
■Jr ■ 



"ie{ii,ii-i}ri{ji,ii + i,...,ji+k-i} 



Due to part 1) of the proof the sum indeed expands only over the given indices. By induction 
all nonsingular submatrices of both matrices X and X k ~ 1 have positive determinant, hence 
if there are any nonzero terms in the sum, it is necessarily positive. Therefore, the only 
thing left to be proven is that there is a nonzero term in the above sum. But all products of 



the form det X, 



det(X* 



-k— lyi,— ,v 

-%l,...,lr A 'h,—,jr 

nonzero. Thus det X > and the proof is complete. 



with ji G {ij, i t -l,ii — 2,...,ii — (k 



1)} for all I are 

□ 



Proof of Theorem 13. lUi Step 1: We will show the existence of a systematic sliding 
parity check matrix H as in (J3.3|) satisfying part (b) of Theorem 13.11 This can be accom- 
plished as follows. Let r := (M+ — 1) and pick a r x r-superregular matrix in Toeplitz 
form, say 



T 



"ti 


... o" 












T 2 




'•• 






U ■■■ 


t 2 tl 







[Ti, . . . ,T T ). 



Theorem 13.81 guarantees the existence of such a matrix over a suitably large field of charac- 
teristic p. Now define 



H 



1 



'-pn—l 
rp(M+l)(n-l) 



g jp(M+l)x(M+l)n 
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Notice that by construction H has the form as in (|3.3|) . We will prove by contradic- 
tion that this matrix satisfies part (b) of Theorem 13.11 In order to do so, write H = 
[ei, . . . , ejvj+i, Ti, . . . , T T ] and assume that % < n — 1 is the smallest index such that T is in 
the span of M other columns of H. Hence these other columns do not involve T\, . . . , Tj_i- 
This implies that there is a linear combination of M + 1 columns of the matrix [/ T ,T] 
with a nonzero coefficient for the column T and having a zero entry at the positions 
1, 2, . . . , i — 1, n — 1, 2(n — 1), . . . , (M + l)(n — 1). Since z < n — 1, these positions are 
indeed different and thus the weight of this linear combination is at most r — % + 1 — (M + 1). 
Consider now the matrix 



Y '■— [Ir—i+i I Ti, Tj + i, . . . , T T ] :- 



tr—i+l ' ' ' *l 



jp(r-i+l)x2(r-i+l) 



where Tj denotes vector of the last r 



i + 1 entries of 7}. Notice that superregularity 
of T implies superregularity of the matrix [T, , . . . , T T ] . The linear combination of M + 1 
columns of [J r , T] above now reads as a linear combination of M + 1 columns of K with a 
nonzero coefficient for the column Tj and having weight at most r — i + 1 — (M + 1). Hence 
picking a suitable set of (at most) r — i + 1 — (M + 1) standard basis vectors, we obtain 
that the column T is in the span of r — 1 other columns of Y. But this is a contradiction to 
Theorem 13.5( d) . 

Step 2: Having constructed a matrix H as in ()3.3)1 with the corresponding column condi- 
tion, we now establish the existence of an (n,n — l)-code having H as its M-th systematic 
sliding parity check matrix. In order to simplify notation write 



H 



h 



o 

ho 



h 



M 



hi 






h 
M 



M+1,2, • • • j HM+l,n 



where hi = (h 



i2: 



i hi n 



[ei, . . . , BM+1, Hl2, ■ ■ ■ , Hin, 

) G F n_1 . We have to find polynomials 

8 5 

-■ l + ^2a i D i e¥[D], b = ^ b i D i e ¥[D] 



n-l 



= 1 



i=0 



such that 



M 



hjD^ + higher powers 



(A.4) 



(A.5) 



(A.6) 



3=0 



(see Theorem 13. Recall that M = L^riJ + ^- If M = 5 (i. e. <5 < n — 1), we may simply 
take a = 1 and 6 = X^=o hiD 1 . Now let us assume M > 5. Comparing like powers of D 
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in (jA.6|) shows that the above requires in particular 



= hi + dihi-i + a 2 hi_ 2 + • • • + eights for any / > 8 
for suitable /&m+i> ^m+2> • • • G F n_1 . For I = 8 + 1, . . . , M these equations read as 



(A.7) 



(as, ■ ■ ■ , oti) 



hM-S+l hu-h 



hu-i 



hu-2 



hi 
h 2 

hx 



■(h M , . . . , /i<5 + i) . 



(Ai 



Notice that hi,...,hM are given data. If we can find a solution (as, ■ ■ ■ , a{) of (|A.8j) . 
then (|A.7j) can be established for all / > M + 1 by choosing /i; suitably. Thereafter, the 
vector polynomial b G F[D] n_1 can be computed by equating the coefficients of D°, . . . , D s 
in the equation b = (Y^iLohiD l )(l + Ylt=i a iD l ) ■ Hence it remains to consider (|A.8[) . This 
equation is solvable if 



rank7Y 



in 



l)(M-8), where H 



h 



MS 



h 



M-l 



hi 



In 



e F 5x(n-l)(M-5)^ 



Notice that p := (n — 1)(M — 5) = (n — 1)L^yJ < 5. We proceed by contradiction and 
assume rank 7i < p. Then there is a column of 7i that is a linear combination of the other 
p — 1 columns. Since Ti is a sub matrix of if (see f)A.4)0 and because of the specific structure 
of Ti, this yields that a column H\j, j = 2, . . . , n, is a linear combination of p — 1 + M + l — 5 
other columns of H. But 



M + p - 5 



6 



n 



+ 5+ (n- 1) 



6 



n — 1 



5 < 



n 



+ 5 = M, 



and thus we arrive at a contradiction to the column property of H (see (b) of Theorem 13. 1|) . 
Hence f)A.8|) is solvable and the existence of a and b as in (|A.5|) and (|A.6|) is established. 

Step 3: Put H = [a, &W, . . . , ftC* -1 )], where a G F[D] and b =: (&«, . . . , fo^ 1 )) G F[D] n_1 
are constructed as in Step 2). Moreover, let C = {v G ¥((D)) n \ vH T = 0}. It remains to 
show that C has degree 5, which amounts to showing that a, b^ l \ . . . , fe( n_1 ) are coprime and 



max{dega, degfe*- 1 ' 1 , . . . , deg^ n ^} = 5. 



(A.9) 



Coprimeness can be assumed without loss of generality since division by a common factor 
would lead to another solution of (jA.5|) and (jA.6|) . Hence H is basic. By construction and 
Theorem 13.11 the M-th column distance of C is given by d c M = L^zrJ +5 + 2. Since this 
number is strictly bigger than the generalized Singleton bound of any (n,n — 1, j)-code, 
where 8 < 8, Equation (|A.9|) follows immediately. 

Thus C is a strongly MDS (n,n — 1, <5)-code and the proof is complete. □ 
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